NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Leveraging Image Difficulty for Run-Time Adaptive DNN Inference on Embedded Devices

https://doi.org/10.1109/ISCAS56072.2025.11044244

Pentsos, Vasileios; Spantidi, Ourania; Zervakis, Georgios; Anagnostopoulos, Iraklis (May 2025, IEEE)

Free, publicly-accessible full text available May 25, 2026
Approximate Multiplier Mapping for Unfairness Mitigation in Energy-Efficient DNNs

https://doi.org/10.1109/ISCAS56072.2025.11043404

Spantidi, Ourania; Zervakis, Georgios; Henkel, Jörg; Anagnostopoulos, Iraklis (May 2025, IEEE)

Free, publicly-accessible full text available May 25, 2026
Late Breaking Results: Leveraging Approximate Computing for Carbon-Aware DNN Accelerators

https://doi.org/10.23919/DATE64628.2025.10993191

Panteleaki, Aikaterini Maria; Balaskas, Konstantinos; Zervakis, Georgios; Amrouch, Hussam; Anagnostopoulos, Iraklis (March 2025, IEEE)

Free, publicly-accessible full text available March 31, 2026
Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization

https://doi.org/10.1109/TETC.2023.3346944

Balaskas, Konstantinos; Karatzas, Andreas; Sad, Christos; Siozios, Kostas; Anagnostopoulos, Iraklis; Zervakis, Georgios; Henkel, J¨org (January 2024, IEEE Transactions on Emerging Topics in Computing)

Deep Neural Networks (DNNs) have shown significant advantages in a wide variety of domains. However, DNNs are becoming computationally intensive and energy hungry at an exponential pace, while at the same time, there is a vast demand for running sophisticated DNN-based services on resource constrained embedded devices. In this paper, we target energy-efficient inference on embedded DNN accelerators. To that end, we propose an automated framework to compress DNNs in a hardware-aware manner by jointly employing pruning and quantization. We explore, for the first time, per-layer fine- and coarse-grained pruning, in the same DNN architecture, in addition to low bit-width mixed-precision quantization for weights and activations. Reinforcement Learning (RL) is used to explore the associated design space and identify the pruning-quantization configuration so that the energy consumption is minimized whilst the prediction accuracy loss is retained at acceptable levels. Using our novel composite RL agent we are able to extract energy-efficient solutions without requiring retraining and/or fine-tuning. Our extensive experimental evaluation over widely used DNNs and the CIFAR-10/100 and ImageNet datasets demonstrates that our framework achieves 39% average energy reduction for 1.7% average accuracy loss and outperforms significantly the state-of-the-art approaches.
more » « less
Full Text Available
Approximate Computing and the Efficient Machine Learning Expedition

https://doi.org/10.1145/3508352.3561105

Henkel, Jörg; Li, Hai; Raghunathan, Anand; Tahoori, Mehdi B.; Venkataramani, Swagath; Yang, Xiaoxuan; Zervakis, Georgios (October 2022, the 41st IEEE/ACM International Conference on Computer-Aided Design)

Full Text Available
Energy Efficient Edge Computing Enabled by Satisfaction Games and Approximate Computing

https://doi.org/10.1109/TGCN.2021.3122911

Irtija, Nafis; Anagnostopoulos, Iraklis; Zervakis, Georgios; Tsiropoulou, Eirini Eleni; Amrouch, Hussam; Henkel, Jorg (March 2022, IEEE Transactions on Green Communications and Networking)

Full Text Available

Search for: All records